Others - a sbarman25 Collection

sbarman25 's Collections

Training & Architectures

Models

Safety / Alignment / Policies / SMI

Evals & Monitoring

Spaces

Agentic

Vulnerabilities

CV / Text-to-Image / Image-to-Image / Diffusion

Others

Hardware-aware Models

Tool Usage (w/VLMs)

Vision Language Models

Others

updated Jul 25, 2024

Masked Autoencoders Are Scalable Vision Learners

Paper • 2111.06377 • Published Nov 11, 2021 • 3

Note Papers (unrelated to above): 📰 Solving olympiad geometry without human demonstrations https://www.nature.com/articles/s41586-023-06747-5 https://deepmind.google/discover/blog/alphageometry-an-olympiad-level-ai-system-for-geometry/
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 58
distil-whisper/distil-large-v2

Automatic Speech Recognition • Updated Mar 6 • 8.02k • 509
Seven Failure Points When Engineering a Retrieval Augmented Generation System

Paper • 2401.05856 • Published Jan 11, 2024 • 2
ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 48